Bidirectional Attention Flow for Machine Comprehension
نویسندگان
چکیده
Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bidirectional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQuAD) and CNN/DailyMail cloze test.
منابع مشابه
Start and End Interactions in Bidirectional Attention Flow for Reading Comprehension
The reading comprehension machine learning task involves reading in a question and returning an answer from an associated context paragraph. This task has proven to be difficult, as the performance of state-of-the-art models still do not compare with human performance. The difficulty of the tasks comes from understanding two separate pieces of information as well as the relationship between the...
متن کاملEnsemble Learning For Machine Comprehension: Bidirectional Attention Flow Models
In this paper, we will explore machine comprehension in Stanford Question and Answering Dataset using ensembled deep recurrent neural networks with bi-directional attention flow. Given a context paragraph, we attempt to answer a query related to the context paragraph. This requires use to not only generate knowledge representation for each question and paragraph, but also create mechanisms that...
متن کاملCS224n Assignment 4: Machine Comprehension with Exploration on Attention Mechanism
This goal of this paper is to perform the prediction task on SQuAD dataset about reading comprehension. Given a pair of context paragraph and a question, we’ll output an answer. To do this, a model is built combining the idea of Bidirectional LSTM and attention flow mechanism. The basic architecture and setup details of the model are introduced, so do the summary of performance and error analys...
متن کاملAttention-based Recurrent Neural Networks for Question Answering
Machine Comprehension (MC) of text is an important problem in Natural Language Processing (NLP) research, and the task of Question Answering (QA) is a major way of assessing MC outcomes. One QA dataset that has gained immense popularity recently is the Stanford Question Answering Dataset (SQuAD). Successful models for SQuAD have all involved the use of Recurrent Neural Network (RNN), and most o...
متن کاملA Convolutional Network Approach to Machine Comprehension
Machine Comprehension is a daunting task, since it requires cross-encoding and exchanging information between a context paragraph and a given query in order to produce an answer span. In designing baselines for a machine comprehension model, each model training has a long turnover, which does not bode well when there is limited time to train. Long runtimes are often from implementing recurrent ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1611.01603 شماره
صفحات -
تاریخ انتشار 2016